Parallel Inner Product-Free Algorithm for Least Squares Problems
نویسنده
چکیده
The performance of CGLS, a basic iterative method whose main idea is to organize the computation of conjugate gradient method applied to normal equations for solving least squares problems. On modern architecture is always limited because of the global communication required for inner products. Inner products often therefore present a bottleneck, and it is desirable to reduce or even eliminate all the inner products. Following a note of B. Fischer and R. Freund 11], an inner product-free conjugate gradient-like algorithm is presented that simulates the standard conjugate gradient by approximating the conjugate gradient orthogonal polynomial by suitable chosen orthogonal polynomial from Bernstein-Szegg o class. We also apply this kind of algorithm into normal equations as CGLS to solve the least squares problems and compare the performance with the standard and modiied approaches.
منابع مشابه
Global least squares solution of matrix equation $sum_{j=1}^s A_jX_jB_j = E$
In this paper, an iterative method is proposed for solving matrix equation $sum_{j=1}^s A_jX_jB_j = E$. This method is based on the global least squares (GL-LSQR) method for solving the linear system of equations with the multiple right hand sides. For applying the GL-LSQR algorithm to solve the above matrix equation, a new linear operator, its adjoint and a new inner product are dened. It is p...
متن کاملParallel Algorithms for Toeplitz Systems
We describe some parallel algorithms for the solution of Toeplitz linear systems and Toeplitz least squares problems. First we consider the parallel implementation of the Bareiss algorithm (which is based on the classical Schur algorithm). The alternative Levinson algorithm is less suited to parallel implementation because it involves inner products. The Bareiss algorithm computes the LU factor...
متن کاملMatrix-Free Polynomial-Based Nonlinear Least Squares Optimized Preconditioning and Its Application to Discontinuous Galerkin Discretizations of the Euler Equations
We introduce a preconditioner that can be both constructed and applied using only the ability to apply the underlying operator. Such a preconditioner can be very attractive in scenarios where one has a highly efficient parallel code for applying the operator. Our method constructs a polynomial preconditioner using a nonlinear least squares (NLLS) algorithm. We show that this polynomial-based NL...
متن کاملData Distribution Analysis of MCGLS Algorithm for Parallel Least Squares Problems
In this paper we mainly study diierent data distribution of MCGLS, a modiied algorithm of CGLS which is a basic iterative method to organize the computation of conjugate gradient method applied to normal equations, for solving sparse least squares problems on massively parallel distributed memory computers. The performance of CGLS on this kind of architecture is always limited because of the gl...
متن کاملFirst-order System Least- Squares for Second-order Partial Diierential Equations: Part Ii. Siam J
analysis of iterative substructuring algorithms for elliptic problems in three dimensions. Least-squares mixed-nite elements for second-order elliptic problems. A least-squares approach based on a discrete minus one inner product for rst order systems. Technical report, Brookhaven National Laboratory, 1994. 4] J. H. Bramble and J. E. Pasciak. Least-squares methods for Stokes equations based on ...
متن کامل